Improved a Priori SNR Estimation for Speech Enhancement Incorporating Speech Distortion Component

نویسندگان

  • Shifeng Ou
  • Chao Geng
  • Ying Gao
چکیده

The well known decision-directed (DD) approach drastically limits the level of musical noise, but the estimated a priori SNR matches the previous frame rather than the current one. Plapous introduced a novel method called two-step noise reduction (TSNR) technique to refine the a priori SNR estimation of the DD approach. However, the performance of this method depends on the accurateness of the estimated speech in its second step. In this paper, we propose an improved approach for the a priori SNR estimation in DCT domain with two steps like the TSNR method. While in the second step, considering the two state components of the estimation error between speech signal and its estimation, the speech distortion component and residual noise component, we make the estimated speech subtracted by its speech distortion as a refined estimation for the clean speech signal. Because the speech distortion component is offset, the estimated a priori SNR is more accurate. A number of objective tests results show the improved performance of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques

a r t i c l e i n f o a b s t r a c t A priori SNR A posteriori SNR SNR cells Spectral distortion Most speech enhancement methods based on short-time spectral modification are generally expressed as a spectral gain depending on the estimate of the local signal-to-noise ratio (SNR) on each frequency bin. Several studies have analyzed the performance of a priori SNR estimation algorithms to impro...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Speech enhancement with improved a posteriori SNR computation

In speech enhancement, the decision-directed (DD) approach to compute the a priori SNR is often used to reduce the musical tones. However, the constant DD weighting factor very close to one results in more speech distortion during transitional speech segments. Contrarily, a time-varying weighting factor gives less speech distortion but with more residual noise in speech pause. In this contribut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013